cp: `disable CG for 8B SFT (2508)` into `r0.3.0` by svcnvidia-nemo-ci · Pull Request #2513 · NVIDIA-NeMo/Megatron-Bridge

svcnvidia-nemo-ci · 2026-02-24T13:29:52Z

beep boop [🤖]: Hi @malay-nagda 👋,

we've cherry picked #2508 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

Summary by CodeRabbit

Chores
- Updated Llama3 8B finetune preset configurations for improved performance.

Signed-off-by: Malay Nagda <malayn@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

svcnvidia-nemo-ci · 2026-02-24T13:29:55Z

/ok to test 4666f54

copy-pr-bot · 2026-02-24T13:29:56Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-02-24T13:37:05Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c106d06 and 4666f54.

📒 Files selected for processing (1)

scripts/performance/configs/llama/llama3_workload_base_configs.py

📝 Walkthrough

Walkthrough

Two Llama3 8B finetune SFT presets have their cuda_graph_impl parameter changed from "transformer_engine" to "none" with a comment indicating CUDA Graphs reduce performance in this configuration context.

Changes

Cohort / File(s)	Summary
Configuration Parameter Updates `scripts/performance/configs/llama/llama3_workload_base_configs.py`	Modified `cuda_graph_impl` from "transformer_engine" to "none" in two Llama3 8B finetune SFT presets to optimize performance based on observed behavior with CUDA Graphs.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	Performance-related change disabling CUDA Graphs lacks required before-and-after performance metrics and quantitative documentation.	Add performance benchmark results comparing configurations with specific metrics (throughput, latency), test environment details, and reference to PR `#2508` justification.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly indicates the primary change: disabling CG (CUDA Graphs) for 8B SFT configs in the r0.3.0 branch, which matches the actual modification of cuda_graph_impl settings.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch cherry-pick-2508-r0.3.0

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

ko3n1g · 2026-03-03T10:15:54Z

Merge via #2509

malay-nagda · 2026-03-03T10:15:57Z

added to rel branch with #2527.

closing this.

disable CG for 8B SFT (#2508)

4666f54

Signed-off-by: Malay Nagda <malayn@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

svcnvidia-nemo-ci requested a review from malay-nagda February 24, 2026 13:29

svcnvidia-nemo-ci added cherry-pick Run CICD labels Feb 24, 2026

copy-pr-bot bot temporarily deployed to nemo-ci February 24, 2026 13:30 Inactive

copy-pr-bot bot temporarily deployed to test February 24, 2026 13:30 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 24, 2026 13:47 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 24, 2026 14:25 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 24, 2026 14:56 Inactive

ko3n1g marked this pull request as draft February 24, 2026 21:22

ko3n1g closed this Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cp: `disable CG for 8B SFT (2508)` into `r0.3.0`#2513

cp: `disable CG for 8B SFT (2508)` into `r0.3.0`#2513
svcnvidia-nemo-ci wants to merge 1 commit intor0.3.0from
cherry-pick-2508-r0.3.0

svcnvidia-nemo-ci commented Feb 24, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

svcnvidia-nemo-ci commented Feb 24, 2026

Uh oh!

copy-pr-bot bot commented Feb 24, 2026

Uh oh!

coderabbitai bot commented Feb 24, 2026

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

ko3n1g commented Mar 3, 2026

Uh oh!

malay-nagda commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

svcnvidia-nemo-ci commented Feb 24, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

svcnvidia-nemo-ci commented Feb 24, 2026

Uh oh!

copy-pr-bot bot commented Feb 24, 2026

Uh oh!

coderabbitai bot commented Feb 24, 2026

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

ko3n1g commented Mar 3, 2026

Uh oh!

malay-nagda commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

svcnvidia-nemo-ci commented Feb 24, 2026 •

edited by coderabbitai bot

Loading